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This paper is an attempt to construct a programmatic framework for the evolution of human language. First, we pres¬ 
ent a novel characterization of language, which is based on some of the most recent research results in linguistics. As 
these results suggest, language is best characterized as a specialized communication system, dedicated to the expres¬ 
sion of a surprisingly constrained set of meanings. This characterization calls for an account of the evolution of lan¬ 
guage in terms of the interaction between cultural and genetic evolution. We develop such an evolutionary model on 
the basis of the mechanism of culturally-driven genetic assimilation. As we show, a careful analysis of the diverse 
effects of this mechanism derives some of the most crucial properties of the evolved linguistic capacity as a specific, 
functional communication system. 
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1. Introduction 

In dealing with the evolution of human language, we 
are eonfronted with two fundamental questions. The 
first is the question of the object of the evolutionary 
proeess: what is language? What type of cognitive 
capacity is it? What is its functionl What is the rela¬ 
tionship between its llinetion and its strueture? The 
seeond question is that of the evolutionary process'. 
how did the human speeies get to master this inered- 
ibly eomplex eapaeity, whieh, more than any other 
trait, distinguishes it from all other speeies? These 
two questions are indelibly interdependent: on the 
one hand, different answers to the object question, 
i.e., different eharaeterizations of language as a eog- 
nitive eapaeity, naturally lead to different answers to 
the process question: what we think about language 
delimits what we ean say about its evolution. On the 
other hand, what we think about evolutionary pro- 
eesses may suggest that some answers to the object 
question are more reasonable than others. 

As far as the object question is eoneerned, lin¬ 
guists and eognitive psyehologists usually subseribe 
to one of two traditional, and radieally opposite. 
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views. Unfortunately, both of these views seem to 
lead to a theoretieal dead-end with respect to the 
question of the evolutionary process. The first tradi¬ 
tional view is the structuralist one. In the second 
half of the 20th century, this view has mostly been 
associated with Noam Chomsky’s theory of Gener¬ 
ative Grammar. For Generative Grammarians, lan¬ 
guage is first and foremost an extremely complex 
and subtle structural system: as native speakers, we 
seem to know much more about the structure of our 
language than we are aware of, and it is this implicit 
knowledge which Generative Grammarians are try¬ 
ing to describe and explain. A very famous example 
of this type of knowledge is that associated with the 
phenomenon of island constraints. Speakers of Eng¬ 
lish, as well as many other similar languages, know 
that the questions in (1 a) and (2a) are grammatical, 
whereas those in (lb) and (2b) are ungrammatical: 

(1) a. Who did the girl kiss? (Answer: The girl 

kissed the boy who delivered the pizza.) 
b. *What did the girl kiss the boy who deliv¬ 
ered? (Answer: The girl kissed the boy who 
delivered the pizza.) 

(2) a. Who did your obsession with Madonna an¬ 

noy? (Answer: Your obsession with Madonna 
annoyed your father.) 
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b. *Who did your obsession with annoy your fa¬ 
ther? (Answer: Your obsession with Madon¬ 
na annoyed your father.) 

Characteristically, speakers who are presented with 
examples such as the above for the first time are gen¬ 
uinely surprised: nobody (apart from linguists) ever 
talks about such facts, and parents and teachers do 
not teach them to their children and students. Yet all 
speakers share the same clear grammaticality judge¬ 
ments concerning these examples, and many others. 
Two crucial questions, then, need to be asked about 
such examples and their likes: first, what type of 
knowledge do speakers possess that determines their 
grammaticality judgements? Second, how did the 
speakers get to possess this knowledge? Chomsky’s 
answer to the first question is purely structural and 
formal. Speakers have a complex mental representa¬ 
tion of the formal properties of grammatical struc¬ 
tures, and these properties determine their gram¬ 
maticality judgements. Take a look at the answers to 
the questions in (1) and (2). The noun-phrase the 
pizza, which is topic of the question in (lb), is struc¬ 
turally embedded within a larger noun-phrase, the 
boy who delivered the pizza. The noun-phrase Ma¬ 
donna, which is the topic of the question in (2b), is 
structurally embedded within the noun-phrase your 
obsession with Madonna. Informally, Chomsky’s 
theory states that structural configurations of certain 
types (in our case, a noun-phrase embedded within a 
noun-phrase) disallow the formation of questions 
such as (lb) and (2b). In the answers to the questions 
in (la) and (2a), on the other hand, the noun-phrases 
which refer to the topic of the question are not em¬ 
bedded within other noun-phrases. Their questions 
are thus perfectly grammatical. Chomsky’s answer 
to the second question, that of the acquisition of this 
implicit, formal knowledge, is the following: chil¬ 
dren cannot possibly learn this type of knowledge 
from their linguistic environment, which means that 
they must come to the world with this knowledge 
encoded in their genes. Generative Grammar thus 
postulates innate knowledge of a rich set of gram¬ 
matical rules and constraints, highly abstract and 
complex. This set is called Universal Grammar. 

This characterization of the cognitive basis of 
language, as an innately-given set of formal rules 
and constraints, suffers from at least four major 
problems. Taken together, these problems strongly 
suggest that Chomsky’s framework should not be 


adopted as the basis of a theory of the evolution of 
language. First, Chomsky’s specific formal theories 
suffer from serious empirical problems. As far as is¬ 
land constraints are concerned, for example, there 
are many attested cases where noun-phrase embed¬ 
dedness does not prevent question formation (cf 
Dean, 1991; Dor, 2000a). Second, the rules and con¬ 
straints of Universal Grammar are decidedly non¬ 
functional: they do not seem to be related in any real 
fashion to the function of language as a tool of com¬ 
munication. The constraint postulated to explain the 
distinction between (la, 2a) and (lb, 2b) should 
make this very clear: there does not seem to be any 
communicative reason why language should impose 
a constraint of syntactic embeddedness on question 
formation. As strange as it may sound to the non¬ 
linguist, Chomsky regards the communicative func¬ 
tion of language as totally contingent to the essence 
of linguistic knowledge. According to his view, 
what we are capable of doing with language (for ex¬ 
ample, transfer information, tell stories, ask ques¬ 
tions, make requests, and so on) cannot tell us any¬ 
thing significant about the structural properties of 
the linguistic system itself This state-of-affairs im¬ 
plies that a Neo-Darwinian theory of the evolution 
of language is unattainable in principle, because the 
Neo-Darwinian mode of evolutionary explanation 
demands a satisfactory functional characterization 
of the relevant evolved trait (in our case, the linguis¬ 
tic capacity) as a descriptive platform. This is why 
Chomsky himself insists that the very attempt to say 
anything meaningful about the evolution of lan¬ 
guage is a total waste of time. 

Different writers (e.g. Pinker and Bloom, 1990; 
Berwick, 1998; Newmeyer, 1998; Jackendoff, 
1999) have attempted to show that Generative 
Grammar does make some evolutionary sense, and 
Maynard Smith and Szathmary (1995) have adopted 
this strategy in their chapter on the evolution of lan¬ 
guage. Two claims are usually involved in this at¬ 
tempt: some writers (most notably Pinker and 
Bloom, 1990) claim that even perfectly-arbitrary 
syntactic regularities are still adaptive to the extent 
that they are conventionally adopted by an entire 
community of speakers. This is so, because they es¬ 
tablish parity between speakers and listeners, send¬ 
ers and receivers of messages: an arbitrary conven¬ 
tion is better than no convention at all. Other writers 
(e.g. Jackendoff, 1999) maintain that some aspects 
of Universal Grammar, although not all of them, are 
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related to general cognitive predispositions, and are 
thus, at least to some extent, functional. These 
claims are problematic: the parity hypothesis is par¬ 
tially right - selection for parity is an important ele¬ 
ment in the construction of any communication sys¬ 
tem. The problem, of course, is that we know of no 
complex system of communication, in biology or 
elsewhere, which can be explained on the basis of 
parity independently of the particular function 
served by the system. In computer communication, 
for example, parity between the different communi¬ 
cating elements is extremely important, and there is 
a certain arbitrariness in the selection of one com¬ 
munication system over another in a specific config¬ 
uration. The point, however, is that each of the can¬ 
didate systems is perfectly functional on its own 
right: it is designed to communicate certain specific 
types of messages. The idea that a complex set of 
formal, arbitrary rules of syntax evolved on the sole 
basis of meaning-blind parity seems to us to stretch 
the credibility of the argument beyond a reasonable 
point. Jackendoff s argument may also have some 
truth to it, but to the extent that it does - it actually 
undermines Chomsky’s own characterization of lan¬ 
guage as a formal capacity and suggests a much 
more functionally-oriented view of it. Jackendoff s 
paper may thus have been written as an attempt to 
defend Universal Grammar, but we believe it can ac¬ 
tually be read as an interesting argument against it. 
At any rate, we believe that Chomsky, rather than 
his defenders, is actually right here: from the evolu¬ 
tionary point of view, his innateness claim cannot be 
reconciled with his specific characterization of lan¬ 
guage as a non-functional cognitive apparatus. 

The third problem with the generative framework 
has to do with the notion of innateness: Chomsky’s 
specific innateness claim - not the general notion 
that humans are genetically endowed with the ca¬ 
pacity to learn language, but the claim that specific 
grammatical rules and constraints are genetically 
encoded - cannot be reconciled with what we kn ow 
about brain structures', neuroscientists from differ¬ 
ent disciplines seem to agree that the brain is an or¬ 
gan of extreme plasticity and generality (cf Elman 
et ah, 1996; Deacon, 1997), which means that the 
chances of finding explicit representations of lin¬ 
guistic specificities innately encoded in brain tissue 
prior to acquisition are very slim. The fourth prob¬ 
lem is that Chomsky’s view of linguistic knowledge 
is extremely static and universalistic. The Genera¬ 


tive theory of principles and parameters is the most 
remarkable representative of this view: according to 
the theory, variability between different languages - 
the fine-grained differences, for example, between 
island constraints in different languages - are also 
encoded in the genes: children come to the world 
with a few parameters for each linguistic rule or con¬ 
straint, and choose the right one for the specific lan¬ 
guage they encounter. This conception faces some 
difficulties, which as far as we are concerned, seem 
pretty much insurmountable: (i) it cannot account 
for the obvious fact that languages are dynamic enti¬ 
ties which constantly change and evolve in their so¬ 
cial contexts - beyond the narrow margin of flexibil¬ 
ity allowed by the theory; (ii) it cannot easily 
account for the enormous amount of variability at¬ 
tested between different languages - again, beyond 
the theory’s scope; and (iii) it is forced to assume a 
genetic change, which enabled an individual in a 
community of hominids to use the full range of fu¬ 
ture languages in a situation when no languages ex¬ 
isted - and then assume that this hardly-fiinctional 
property spread in the whole community. Such a 
scenario either assumes a single mutation with a 
huge effect, or a whole series of initially-neutral mu¬ 
tations, all of which have become fixed in the popu¬ 
lation through genetic drift. Needless to say, such 
evolutionary assumptions are hardly defensible. 

The opposite traditional answer to the object 
question is the functional one: according to this 
view, language is simply a general-purpose tool for 
the communication of meanings. A corollary of this 
view is that grammatieal complexities in natural lan¬ 
guages are theoretically reducible to principles of 
general cognition. As far as island constraints are 
concerned, for example, functionalists usually at¬ 
tempt to explain them as manifestations of the cog¬ 
nitive processing of language: the ungrammatical 
sentences in (lb) and (2b) are somehow more diffi¬ 
cult to process than the grammatical ones in (1 a) and 
(2a) (Dean, 1991). According to this view, there is 
nothing cognitively unique about the linguistic ca¬ 
pacity: languages comply with the general con¬ 
straints imposed on human cognition, and children 
simply acquire their languages on the sole basis of 
external input - just like they do with respect to any 
other socially-based type of knowledge. As far as 
the evolutionary question is concerned, adherents of 
this functional view naturally concentrate on the so¬ 
cial foundations of the evolutionary process, and 
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equate the cognitive evolution of language with the 
evolution of general intelligence (Deacon, 1997; 
Dunbar, 1996). At first sight, this perspective may 
seem to fare much better than the Generative one, 
but it actually suffers from at least three problems, 
which make it very hard to defend. The first problem 
is, again, an empirical one: although the specific 
functional theories which attempt to reduce gram¬ 
matical complexities to general cognitive principles 
(e.g. Langacker, 1987; Wierzbicka, 1988; Dean, 
1991; Givon, 1995) may suggest first approxima¬ 
tions of the cognitive precursors of linguistic com¬ 
plexities, these first approximations do not usually 
get anywhere close to an explicit and precise expla¬ 
nation of the relevant specific phenomena. As such, 
they do not offer a real alternative to the Chomskian 
characterization of the object of evolution. Second, 
the very attempt to reduce language to general cog¬ 
nition is problematic: we have ample evidence - 
from the specific patterns of language acquisition 
and language breakdown, and from the formation of 
de novo languages, such as the sign language devel¬ 
oped by the deaf children in Nicaragua (Kegl et al., 
1999) - for the unique status of language as a highly 
specialized cognitive capacity. Whereas Chomsky’s 
theory captures the uniqueness of language at the ex¬ 
pense of its functionality, the functional theories at¬ 
tempt to salvage the functional aspects of language 
at the expense of its uniqueness. Third, the func¬ 
tionalists’ attempt to characterize language as & gen¬ 
eral-purpose communication tool does not really 
enhance our understanding of the function of lan¬ 
guage - it is simply too general: the claim that we 
use language “for communication” is similar to the 
claim that we use our visual system “for seeing”. As 
Marr (1982) so convincingly claimed, a functional 
characterization of any cognitive system should be a 
much more specific one. 

It thus seems that both traditional views lead us to 
a theoretical dead-end. What we need is a character¬ 
ization of language as a cognitive capacity that is 
both empirically viable and functionally specific. 
This characterization should be able to reconcile the 
domain-specificity of language on the behavioral 
level with the high level of plasticity of the brain, 
and account for both the obvious universality of lan¬ 
guage as a cognitive capacity and the equally obvi¬ 
ous fact that languages are dynamic, variable enti¬ 
ties. The most important challenge in here is that of 
the explication of the relationship between linguistic 


function and linguistic structure. Two questions 
have to be answered: first, can the seemingly non¬ 
functional complexities of grammar - those which 
have inspired Chomsky’s formal theories - be ex¬ 
plained in a functional and empirically viable man¬ 
ner? Second, can we define the general function of 
language in a non-trivial fashion? 

As we will show below, a long series of empirical 
results, accumulated in the last two decades by se- 
manticists and lexical semanticists, suggests that the 
answers to both of these questions lies in the nature 
of the semantic categories of linguistic communica¬ 
tion - the specific types of meanings which are ex¬ 
pressible through language (Dor, 2000b). On the one 
hand, we shall claim that grammatical complexities 
in natural languages are best explained on the basis 
of these specific types of meanings: grammatical 
complexities are neither autonomously formal (as 
the structuralists claim), nor reflections of general 
cognitive principles (as the functionalists claim), but 
structural reflections of linguistic meaning. On the 
other hand, it turns out that the types of meanings 
which are expressible through language constitute a 
very constrained subset of the types of meanings 
which can be thought, we can use language to ex¬ 
press only a fraction of what we can think and feel. 
In technical terms, semantic categories are a subset 
of conceptual categories. This state-of-affairs al¬ 
lows for the construction of a non-trivial functional 
characterization of language, as a unique and highly- 
constrained communication system, dedicated to 
the communication of this specifically restricted set 
of meanings. Technically, again, this communica¬ 
tion system is best described as a mapping-system 
between linguistic meaning and linguistic form. The 
representational level of Linguistic Meaning deter¬ 
mines the set of the semantic categories of linguistic 
communication; the representational level of Lin¬ 
guistic Form determines the structural properties of 
the speech-channel. 

This characterization of language as a cognitive 
capacity allows for a major re-framing of the evolu¬ 
tionary question, as the question of the gradual ex¬ 
pansion and sophistication of the linguistic map¬ 
ping-system, i.e., the expansion and sophistication 
of the set of semantic categories, their interactions, 
and their modes of mapping onto the speech chan¬ 
nel. The re-framing of the evolutionary question nat¬ 
urally calls for an answer in terms of the interaction 
between cultural evolution and genetic evolution. 
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The idea that the evolution of language eonsisted 
of a eomplex interaction between genes and culture 
has been discussed by Pinker and Bloom (1990), 
Maynard Smith and Szathmary (1995), Jablonka 
and Rechav (1996), Deacon (1997) and Kirby and 
Hurford (1997). In this paper, we shall attempt to de¬ 
velop and explicate this idea both in terms of the 
evolutionary properties of the interactive process 
and in terms of its linguistic object. 

As we shall claim, the process of cultural linguis¬ 
tic evolution consisted of the selection of, social 
agreement on, and cultural evolution of the semantic 
categories for linguistic communication, and the 
gradual sophistication of the mapping-system for 
these categories. In this long, gradual, and complex 
process, a social group gradually isolates certain as¬ 
pects of its epistemology, sharpens and develops 
them, reaches social agreement about them, and de¬ 
velops sophisticated structural means for communi¬ 
cating them within the community. Here we part 
way with Kirby (1999), who assumes the cultural 
evolution of the syntactic mapping-system on the 
basis of a pre-given semantic platform (see section 
3.2 for further discussion). Needless to say, the se¬ 
mantically-based cultural evolution of language is a 
permanent process, which goes on until today. As 
we shall claim, this cultural process was made possi¬ 
ble, throughout most of evolutionary time, by the 
great behavioral plasticity of hominids. This plastic¬ 
ity included all aspects of linguistic behavior: pro¬ 
duction, comprehension, acquisition and transmis¬ 
sion. 

Crucially, however, at various points in evolu¬ 
tionary time, the behavioral plasticity of the speak¬ 
ers was stretched by cultural evolution, to the point 
where differences in the ability to learn language be¬ 
came selectively important. At these points, genetic 
assimilation occurred - on all cognitive fronts. At 
every step of the way, linguistic culture constituted 
the selective environment in which genes that con¬ 
tributed to linguistic performance, acquisition and 
transmission, were selected. The interaction be¬ 
tween continuous, directional cultural evolution and 
partial genetic assimilation resulted in a consecutive 
set of evolutionary stages, in which the expressive 
envelope of language was expanded and sophisti¬ 
cated, and speakers were selected on the basis of 
their linguistic performance. We will claim that this 
process of cultural evolution and genetic assimila¬ 
tion gradually created what we think of as linguisti¬ 


cally-biased cognition: a cognitive make-up which, 
without encoding linguistic specificities on a genetic 
basis, is still biased towards rapid learning of the 
linguistic mapping-system. 

In section 2 of the paper, we present our charac¬ 
terization of language as a specialized communica¬ 
tion system. As this paper is mainly addressed to a 
biologically-oriented readership, the characteriza¬ 
tion will be presented in a decidedly informal and 
general fashion, and many of the controversial lin¬ 
guistic claims will not be argued for in a fully techni¬ 
cal and detailed fashion. A more elaborate discus¬ 
sion can be found in Dor (2000a, b, c) and in Dor and 
Jablonka (in press). In section 3, we provide prelimi¬ 
nary evidence for our assumption that both cultural 
and genetic evolution were involved in the develop¬ 
ment of this system. We then discuss the interaction 
between cultural and genetic evolution, and expli¬ 
cate the fundamental features of the mechanism of 
genetic assimilation. In section 5, we describe the 
process of the evolution of language, and suggest 
that some of the constitutive properties of the 
evolved system are actually derivable from the char¬ 
acteristics of the evolutionary process. In section 6, 
we conclude with a few remarks regarding the more 
general consequences of the analysis. 

2. On language as a cognitive capacity 

As we have already indicated, a long series of empir¬ 
ical results, accumulated throughout the last decade 
or two in the domains of semantics and lexical se¬ 
mantics, suggests that grammatical complexities in 
natural languages are best accounted for on the basis 
of a careful examination of the specific meanings in¬ 
volved - the meanings of the words, expressions and 
constructions which combine to create the relevant 
grammatical structures (cf Dowty, 1979, 1991; 
Jackendoff, 1983, 1990; Levin and Rappaport- 
Hovav, 1991, 1995; Frawley, 1992; Levin, 1993; 
Goldberg, 1995; Dor, 1996, 2000a, b, c; Alsina et 
al., 1997; Van Valin and LaPolla, 1997, and refer¬ 
ences therein). Note that this type of explanation 
runs counter both the structuralists’ and the func¬ 
tionalists’ views. To demonstrate, let us look at the 
following examples: 

(3) a. John broke the stick, 
b. The stick broke. 
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(4) a. John hit the tree, 
b. *The tree hit. 

Why is (4b) ungrammatical whereas (3b) is a per¬ 
fectly good sentence? The explanation lies in the 
meanings of the verbs break and hit. Break is what 
lexical semanticists call a change-of-state verb: it 
denotes an event, the event of breaking, in which an 
entity (in our case, the stick) changed its state as a 
result of some activity (in our case, John’s action). 
Hit, on the other hand, is a surface-contact verb: it 
denotes an event in which an entity (in our case, the 
tree) was touched on its surface by something or 
someone (in our case, John). Now, as it turns out, all 
change-of-state verbs behave like break (we can say 
‘the tree bent, folded, shattered, cracked)] all sur¬ 
face-contact verbs behave like hit (we cannot say 
*the tree slapped, struck, bumped, stroked and so 
on). The distinction between the patterns of gram¬ 
matical behavior is thus correlated with a parallel 
distinction between the meanings of the verbs in¬ 
volved in the construction. Different technical theo¬ 
ries have been developed throughout the years to ex¬ 
plain how the meanings of the verbs determine the 
grammatical patterns, and these theories will not in¬ 
terest us here. The crucial point for the purposes of 
this discussion is that we need to assume that speak¬ 
ers (of English, in this case) have an implicit mental 
representation of the semantic distinction between 
the types of events denoted by the different verbs. In 
other words, we have to assume that speakers men¬ 
tally categorize, or classify, events according to their 
semantic properties. As it turns out, speakers implic¬ 
itly classify events according to a very rich and de¬ 
tailed scheme of categories: thus, for example, 
events are classified in terms of their telicity. telic 
events (like the event of breaking a window) are 
bounded in time; they have a necessary end point 
(the event of breaking the window must include the 
point in time in which the window is broken - and 
that is where it necessarily ends). Atelic events (like 
the event of swimming in the pool) do not have such 
a necessary end point. Events can also be classified 
on the basis of their thematic structure (the number 
of their participants and their specific modes of in¬ 
volvement in the events); their epistemic status 
(whether we know for a fact that they took place in 
the real world, or we think that they did, or hope so), 
and so on and so forth. Each of these categorical dis¬ 
tinctions determines some aspects of grammar. 


Sometimes, grammatical complexities are deter¬ 
mined by a combination of two or more of these 
semantic categorizations. Needless to say, speakers 
are totally unaware of this relationship between se¬ 
mantic categorization and grammatical behavior. 

The semantic categorization of events is a major 
determinant of grammatical knowledge, but it is not 
alone: some grammatical phenomena seem to be de¬ 
termined by other schemes of semantic classifica¬ 
tion. We classify entities in the world, for example, 
on the basis of a set of categories, including animacy 
(dogs and people are animate] chairs and tables and 
are inanimate), gender {male and female), and 
countability (you can count bottles and people; you 
cannot count rice and water). Each of these categor¬ 
ical schemes, and others, determines some gram¬ 
matical patterns in natural languages. Some other 
classifications are more abstract: we classify times 
{past, present, future)] spatial relations (an entity is 
behind, above, below another entity); and logical re¬ 
lations {negation, conditionals). These, too, deter¬ 
mine grammatical patterns. As our understanding of 
these schemes of categorization deepens, more and 
more grammatical complexities turn out to reflect 
them in interesting ways - including those complex¬ 
ities which have for the last forty years been thought 
of as the ultimate proof of the formal, non-functional 
nature of grammar. In a recent paper, for example. 
Dor (2000a) develops a semantically-based account 
of island constraints, which we have mentioned in 
the beginning of the paper. Here are the examples 
we have looked at: 

(5) a. Who did the girl kiss? (Answer: The girl 

kissed the boy who delivered the pizza.) 
b. *What did the girl kiss the boy who deliv¬ 
ered? (Answer: The girl kissed the boy who 
delivered the pizza.) 

(6) a. Who did your obsession with Madonna an¬ 

noy? (Answer: Your obsession with Madonna 
annoyed your father.) 

b. *Who did your obsession with annoy your fa¬ 
ther? (Answer: Your obsession with Ma¬ 
donna annoyed your father.) 

Dor’s (2000a) technical account derives these phe¬ 
nomena, together with a wide range of related facts 
from a set of formal semantic considerations, which 
we shall not deal with here. For our present pur- 
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poses, it will suffice to take an informal look at the 
semantic underpinnings of the above simple exam¬ 
ples. The sentences in (5) and (6) are interrogative 
sentences: they are used for asking questions. When 
a speaker asks a question of this type, he or she actu¬ 
ally performs a rather complex speech-act: the 
speaker (i) tells the interlocutor that a certain event 
occurred (or may have occurred, or did not occur, 
and so on), and (ii) asks the interlocutor to provide a 
certain piece of information about this event. Thus, 
for example, when a speaker asks, “who did the girl 
kiss?”, he or she tells the interlocutor that “the girl 
kissed someone", and asks the interlocutor to pro¬ 
vide the identity of that person. When a speaker 
asks, “who did John say that the girl never kissed?”, 
he or she tells the interlocutor that, according to 
John, the event in which “the girl kissed a particular 
person” never occurred, and asks the interlocutor to 
provide the identity of that person. Technicalities 
aside. Dor (2000a) shows that speakers can ask 
questions concerning the entities which participated 
in the event they told their interlocutor about, and 
concerning the general properties of the event - its 
time, place, cause and so on. Crucially, however, 
speakers cannot ask questions concerning entities 
which did not participate in the event they reported. 
Now, as we have said, speakers know a great deal 
about the semantics of events, including the seman¬ 
tics of kissing and annoying events. Among other 
things, speakers know that both types of events in¬ 
volve two necessary participants: kissing events in¬ 
volve the kissing participant and the participant be¬ 
ing kissed', annoying events involve the annoyed 
participant and the annoying participant. Now, in 
(5a) and (6a), the speaker tells us about a kissing 
event, and an annoying event respectively, and asks 
about participants in these events'. (5a) is about the 
participant being kissed; (6a) is about the participant 
being annoyed. In (5b) and (6b), on the other hand, 
the speaker tells us about the very same events, but 
asks about entities which do not participate in them'. 
(5b) is a question about the pizza, which does not 
participate in the kissing event; (6b) is a question 
about Madonna, which does not participate in the 
annoying event - the father is annoyed by the obses¬ 
sion, not by Madonna herself (5b) and (6b) are thus 
ungrammatical because they violate a semantic con¬ 
straint on linguistic communication. Rather than 
constitute the prototypical example of the formal 
character of grammatical complexities, island con¬ 


straints actually demonstrate the functional nature 
of grammatical structures. 

Now, the following point is crucial: in principle, 
it seems that we may classify our knowledge of the 
world (our knowledge of events, entities, properties 
and so on) on the basis of an infinite number of cate¬ 
gories. Indeed, when we perceive the world, think 
about it, or have feelings about it, we use a very 
large, diverse and constantly-changing set of cate¬ 
gories: we may, for example, categorize people on 
the basis of the categorical distinction between 
friend and/oe; we may classify physical entities on 
the basis of their practical utility, or their price', we 
may categorize species as endangered or not; and so 
on and so forth. We usually classify events as inter¬ 
esting or boring', and we distinguish between events 
in which someone we know participated, and events 
in which only strangers took part. It would seem rea¬ 
sonable, therefore, to expect that grammars of natu¬ 
ral languages would reflect this infinitely large and 
diverse set of categories, or at least that different lan¬ 
guages would reflect radically diverse subsets of 
these. Crucially, however, a survey of the world’s 
languages reveals a very surprising fact: languages 
are definitely not all alike, but the semantic catego¬ 
ries which are reflected by grammatical complexi¬ 
ties in natural languages belong to a very con¬ 
strained subset of all the categories which we can 
use to think, feel and conceptualize about the world'. 
some semantic categories turn out to be grammati¬ 
cally marked in language after language, whereas 
some others consistently do not participate in the 
grammatical game. Specifically, no language we 
know grammatically marks the distinction between 
friend and foe, or between interesting and boring 
events. The categorical distinctions between ani¬ 
mate and inanimate entities, telic and atelic events, 
factual and hypothesized events are reflected in vir¬ 
tually every language we know, and so are the dis¬ 
tinctions between different spatial relations and 
time configurations. (Note that different languages 
may relate to the same categorical scheme in differ¬ 
ent ways. Thus, for example, different languages 
carve up time in different ways: some languages 
mark the distinction between/wtMre and non-future', 
some mark the distinction between past and non- 
past, some have much more elaborated systems.) It 
seems as if all natural languages went through a very 
similar process of epistemic selection - where a spe¬ 
cific subset of all possible categorical distinctions 
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was isolated, highlighted and marked by grammars 
for the purposes of linguistic communication. 

Circumscribing the set of semantic categories 
which universally determine grammatical structures 
allows for the formulation of a non-trivial functional 
definition of language: language is not just a gen¬ 
eral-purpose communication system. It is a commu¬ 
nication system, structurally designed to communi¬ 
cate messages which are grounded in a specific and 
constrained categorical scheme. This categorical 
scheme is centered around a specific set of events 
and situations (not all types of events and situa¬ 
tions), their participants, their time and place, their 
properties, and some of the properties of their partic¬ 
ipants. Thus, these categories determine the expres¬ 
sive envelope of language - they determine which 
meanings, and which meaning combinations, are ex¬ 
pressible by the means provided by natural lan¬ 
guages. Again: the expressive envelopes of different 
languages are different in interesting and subtle 
ways, but they all share a common core. Types of 
messages which fall comfortably within this core are 
best suited for communication through language. 
Types of messages which do not comfortably com¬ 
ply with it turn out to be more difficult to communi¬ 
cate through language. Many other types of mes¬ 
sages, which do not comply with this scheme at all, 
turn out to be virtually impossible to communicate 
through language. Interestingly, many of the mes¬ 
sages whieh turn out to be very difficult to commu¬ 
nicate through language seem to be very well suited 
for communication through other means of commu¬ 
nication: we can mime and dance them, use facial 
expressions and body language to express them, 
paint and draw them, write and play music, prepare 
charts and tables, write mathematical formulae, 
screen movies and videos, and so on. 

To take a simple example, think about a simple 
knot, a figure of eight. Suppose you know how to 
make the knot, and want to teach someone how to 
make it. There are at least two very simple ways to 
do this: you can either take a piece of rope and dem¬ 
onstrate the procedure, or draw a simple drawing 
which accomplishes the same goal. Suppose, how¬ 
ever, that you insist on communicating this knowl¬ 
edge through language. Now, the communicative 
task becomes surprisingly more difficult to achieve. 
You may want to formulate the following set of in¬ 
structions (from Aitchison, 1996): 


1. Pass the end of the rope over the standing 
part. 

2. Take the end under the standing part away 
from the loop. 

3. Bring the end of the rope back up over itself 
towards the loop. 

4. Pass the end down through the loop. 

5. Pull tight. 

This description of the procedure is accurate, but 
note that the cognitive effort needed to formulate the 
instructions, and the effort needed to understand 
them, are much greater than that involved in the 
face-to-face demonstration. This is so, beeause the 
message itself- the procedure of tying the knot - has 
to be reframed and broken down in order to comply 
with the categorical scheme of language. The in¬ 
structions break down the event of tying the kn ot 
into a series of smaller events, each of which sepa¬ 
rately meets the conditions set by the semantics of 
events. In each of these smaller events the first par¬ 
ticipant, an agent (the person holding the rope), 
causes another participant, the theme (e.g. the end of 
the rope) to move in space and reach a new position 
in relation to a third participant, the goal (e.g. the 
standing part, or the loop). This is a type of event 
which semantic categorization recognizes. Note, 
and this is a crucial point, that if you learn to tie the 
kn ot in non-verbal ways - from a picture, or by look¬ 
ing at someone doing it - you may never have to 
conceptualize about such entities as the end of the 
rope, the standing part or the loop. Neither will you 
have to conceptualize about the different stages de¬ 
scribed in the instructions. You will probably con¬ 
ceptualize about one entity (i.e. the rope itself, a sin¬ 
gle physical unit, the object which is tied into a knot) 
and one event (i.e. tying the rope). This event, how¬ 
ever, does not fall comfortably within the categori¬ 
cal scheme of language: specifically, the rope which 
participates in it plays both the role of theme and of 
goal in the event. A major component of event se¬ 
mantics does not seem to allow this: informally, lan¬ 
guage is not designed to describe events in which the 
same participant plays two different spatial roles. 

The linguistic communication of simple, practi¬ 
cal instructions of this type is more cumbersome 
than the communication of the same information in 
non-verbal terms, but, as we have seen, it is still pos¬ 
sible. In more complicated cases, such communica- 
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tion is virtually impossible. Think about learning to 
play basketball, practice karate, or play the violin, 
through the sole means of linguistic communication. 
Anyone who is engaged in these activities knows 
that language plays a rather negligible role in their 
instruction. They are best taught by mimesis (cf 
Donald, 1991). Moreover, emotional meanings are 
best communicated through facial expressions, 
body language, music and dance. Language may 
sometimes help in categorizing the emotion, but is 
rather helpless when it comes to complex, holistic 
emotions. As Deacon (1997) notes, “a rich and com¬ 
plex language is still no substitute for a shocked ex¬ 
pression, a muffled laugh, or a flood of silent tears, 
when it comes to communicating some of the more 
important messages of human social relationships”. 
And think about visual information. As Donald 
(1991) says, “visual thinking is now seen as largely 
autonomous from language”. Visual meanings are 
best communicated by visual means - pictures, 
drawings, hand waves - and this is another domain 
where language is practically useless: to convince 
yourself of that, make a simple, abstract free-hand 
drawing on a piece of paper, and try to verbally com¬ 
municate the drawing to a friend - without showing 
him or her the drawing itself, and without moving 
your hands about. As you will realize, this is incredi¬ 
bly difficult. You will probably be able to give your 
friend a general idea of the drawing, but you will not 
be able to provide a detailed description which will 
allow your friend to reduplicate the drawing. In this 
sense, a picture is really “worth a thousand words”: 
there are many types of visual meanings which sim¬ 
ply cannot be communicated through language. As 
Donald (1991) argues, then, “a significant part of 
normal human culture functions without much in¬ 
volvement of symbolic language. Examples are 
found in trades and crafts, games, athletics, in a sig¬ 
nificant percentage of art forms, various aspects of 
theater, including pantomime, and most social rit¬ 
ual. ... When humans lack language, provided they 
do not suffer some brutal brain lesion that robs them 
of other, more fundamental, cognitive skills, they 
can continue to participate in all those forms of hu¬ 
man culture that do not require language.” 

The last paragraphs concentrated on that set of 
meanings which are virtually inexpressible through 
language, but there is, obviously, a remarkable set of 
meanings which are exquisitely expressible through 
language - and this is where the other means of 


communication are hopelessly lost. Language al¬ 
lows us to perform a rich set of speech-acts - make 
claims, tell stories, ask questions, make requests, 
promise and threat - concerning events of different 
types, their participants, their properties, their 
epistemic status, their location and time, and so on. 
Informally, we use language to tell each other that 
we know, or think, or guess, that an event of a certain 
type occurred, in which someone did something to 
someone, at a certain place, at a certain point in time, 
for a specific reason. Or we can use language to ask 
each other whether such a event happened, or who 
was involved in it; or to promise that an event of a 
certain type will happen; or to request that such an 
event will take place; and so on. These meanings are 
very difficult to communicate through mimesis, and 
almost impossible to communicate through dance, 
facial expressions and so on. (We can communicate 
a general plea by a facial expression, but we cannot 
express a specifically detailed request, such as “Can 
you please drive me to the airport tonight? I need to 
catch a plane”.) 

Natural language, then, is a communication tool 
which is structurally designed for the communica¬ 
tion of a constrained set of meanings. The structural 
medium for the communication of this set of mean¬ 
ings is mostly the auditory channel - although sign- 
languages use the visual channel with the same level 
of efficiency. In terms of traditional cognitive sci¬ 
ence, this characterization of language is best cap¬ 
tured by the notion of a mapping-system: a cognitive 
system which maps representations of one type (in 
our case, meanings which fall within the expressive 
envelope of language) onto representations of an¬ 
other type (in our case, the set of grammatical mark¬ 
ers which are visible on the speech-channel). The 
following is a schematic representation of this view 
of language: 
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In this schematic representation, language is charac¬ 
terized as a transparent mapping-system between 
the levels of Linguistic Meaning and Linguistic 
Form. The representational level of Linguistic 
Meaning constrains the set of meanings which are 
expressible through language: as we have seen, a 
major set of Conceptual Representations is not lin- 
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guistically expressible. The level of Linguistic 
Form, which we have not discussed up to this point, 
includes all the structural tools which are visible on 
the speech-channel, and are used to mark linguistic 
meanings in natural languages.' phonological regu¬ 
larities, morphological markers, linear order, adja¬ 
cency, and so on. What is missing from this picture 
is abstract, non-functional, syntax, of the type envi¬ 
sioned by Generative Grammarians: as we have seen 
before, we have good reasons to assume that such 
syntactic representations are no longer needed. 
Note, moreover, that this schematic characterization 
allows for both linguistic variability and universal¬ 
ity, on all fronts: we know that a core set of semantic 
categories is universal, but different languages may, 
in principle, choose to occupy the level of Linguistic 
Meaning with different semantic categories, and dif¬ 
ferent categorical combinations. Different lan¬ 
guages obviously use different subsets of Linguistic 
Form to mark their meanings, but some such mark¬ 
ers are universal. Finally, different languages may 
map different semantic categories onto different 
markers; this is an essential property of the system. 
Thus, for example, thematic roles (the roles played 
by participants in events) may be marked by linear- 
order in some languages, and by morphological 
case-markers in others: in English, we know that 
John was the active participant in the event de¬ 
scribed by a sentence like John invited Bill for a 
meeting, because the noun-phrase John appears be¬ 
fore the verb; we know that Bill was the more pas¬ 
sive participant because it appears after the verb. In 
Russian, on the other hand, the noun-phrases John 
and Bill are marked by special morphological case- 
markers, which specify their mode of involvement 
in the event. Linear-order in Russian is thus much 
more flexible. 

This characterization of language immediately 
re-frames the question of the evolution of language. 
It is now neither the question of the evolution of a 
formal, meaning-blind system, nor the question of 
the evolution of a general-purpose communication 
system. It is the question of the evolution of a spe¬ 
cific communication system, dedicated to the com¬ 
munication of a constrained set of meanings by 
means of sound concatenation. In the above cogni¬ 
tive terms, it is the question of the evolution of a 
mapping-system: The gradual expansion and so¬ 
phistication of the representational levels of Lin¬ 
guistic Meaning and Linguistic Form, and their 


transparent mapping onto each other. As we shall 
show in the remainder of this paper, this novel evo¬ 
lutionary question calls for an answer in terms of a 
specific interaction between cultural evolution and 
genetic evolution. 

3. Cultural evolution and genetic evolution 

Like all other evolutionary processes, the process of 
the evolution of the linguistic mapping-system is a 
multi-faceted one. Different questions may be asked 
about it: we may, for example, inquire about the 
types of selection pressures that played a role in the 
evolution of language; or ask about possible stages 
in the evolutionary process, trying to draw a reason¬ 
able line of progression, or a branching pattern; or 
focus on the dynamic patterns of the process. In the 
following, we will suggest an answer to the last 
question, that of the dynamics of the process, and ad¬ 
dress the other questions only to the extent that they 
directly relate to it. We will suggest that the dynam¬ 
ics of the evolutionary process was that of the inter¬ 
action between cultural evolution and genetic evolu¬ 
tion. More specifically, we will present the back¬ 
bone of an evolutionary model, where the cultural 
evolution of language drives the process of genetic 
evolution through the mechanism of genetic assimi¬ 
lation. Before we delve into the model, however, we 
need to justify the very preliminary notion that both 
cultural and genetic evolution were involved in the 
process. Two questions need to be answered: first. 
Why do we need to assume a cultural component in 
the evolution of language in the first place? Why is 
genetic evolution not sufficient? Bickerton’s (1990) 
model of the evolution of language, for example, 
seems to ignore the role of cultural evolution alto¬ 
gether: for him, the evolution of language is the ge¬ 
netic evolution of a specific language organ. Why is 
that not enough? Second, why do we need to assume 
a specific genetic component in the evolution of lan¬ 
guage? Note that this is not simply the question of 
whether the hominid mind went through a general 
process of evolution, which resulted in a more pow¬ 
erful intelligence: this much is beyond dispute. The 
question is why we need to assume that the evolu¬ 
tionary process resulted in a genetic make-up which 
is in some way specifically suited for language. Dea¬ 
con (1997), for example, believes that such an as¬ 
sumption is unwarranted: according to him, the hu- 
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man mind went through a general process of evo¬ 
lution, which resulted in higher intelligence, capable 
of symbolic (non-linguistic) representation and 
communication. Language simply adapted itself to 
this sophisticated mind. So, why is general intelli¬ 
gence not a rich enough substratum for the cultural 
evolution of language? Why do we need to assume 
that genetic evolution/or language was a significant 
component of the process? 

3.1. Why assume cultural evolution ? 

There are many things we do not know about our an¬ 
cestors. However profound our ignorance, one thing 
seems beyond dispute: our ancestors did have cul¬ 
tural traditions, and these played a major role in their 
life. We know that traditions are ubiquitous in 
higher animals, and that they involve every aspect of 
the animal’s life: modes of foraging; criteria for se¬ 
lecting mates; ways of avoiding predators; criteria 
for choosing a habitat; practices of parental care; and 
so on (Avital and Jablonka, 2000). Traditions are 
particularly well studied in primates. Whiten et al. 
(1999), who summarize the long-term studies of 
seven populations of the co mm on chimpanzee in 
Africa, describe 39 different cultural traditions, five 
of which have something to do with communicative 
functions. As the systematic and comprehensive 
study of animal traditions has only recently started, 
this is clearly an underestimation. Our current un¬ 
derstanding of the cultures of our close extant rela¬ 
tives, as well as a whole host of archaeological find¬ 
ings from the early hominids’ period, strongly 
suggest that hominids did indeed live in cultural set¬ 
tings, and that these cultural settings reflected local, 
evolving traditions. 

Traditions are the result of the transmission, over 
generations, of patterns of behavior through social 
learning. They are the result of cultural evolution. In 
this sense, cultural evolution can be thought of as the 
change in the frequency and nature of socially- 
learned and socially-transmitted behaviors in a pop¬ 
ulation. When cultural evolution is cumulative, the 
process often leads to the gradual sophistication of 
the cultural practice, each evolutionary step building 
on previous ones. A good example of such a cumula¬ 
tive process of cultural evolution may be the evolu¬ 
tion of tool-making techniques. In a process of this 
type, a cultural innovation, invented by a single indi¬ 


vidual or by a whole group, is adopted and absorbed 
(and often extended and improved) by the other 
members of the community - most often by younger 
individuals, whose behavioral plasticity is exten¬ 
sive. Such innovations, once they are accepted by 
the community, become the behavioral norm for the 
next generation (Morgan, 1994). This behavioral 
norm may then provide the basis for the next innova¬ 
tion, and so on and so forth. The process, of course, 
need not be continuous, and may proceed by fits and 
starts. So, although we still know very little about 
the dynamics of the process, it seems quite reason¬ 
able that cumulative cultural evolution played a ma¬ 
jor role in the development of hominid’s material 
culture. 

In many respects, linguistic culture is similar to 
material culture: first, languages are extremely im¬ 
portant tools used by individuals in societies for the 
narration of stories, for the transfer of information, 
and for the construction of a socially-based world¬ 
view. Second, just like material artifacts, languages 
change constantly - in ways which are intimately re¬ 
lated to social modification and transformation. 
There is no real reason to assume that this dynamic 
aspect of language is only a very recent develop¬ 
ment. Moreover, recent experimentation with higher 
primates (e.g. Savage-Rumbaugh and Lewin, 1994) 
demonstrates that even common and pigmy chim¬ 
panzees, who do not communicate linguistically in 
the wild, are capable of developing this capacity, at 
least to some extent, when exposed to language in a 
structured cultural environment. (Researchers esti¬ 
mate that the language-tutored chimpanzees reached 
the proficiency level of two-year-old human chil¬ 
dren.) This demonstrates that the culturally-based 
capacity for language can develop, at least to some 
preliminary extent, in the absence of a specific ge¬ 
netic basis. It is thus reasonable to assume that cul¬ 
tural evolution played at least some role in the evo¬ 
lution of language. 

3.2. Why assume genetic evolution for language? 

It has been argued (e.g. Donald, 1991; Lieberman, 
1991; Deacon, 1997) that language may have 
emerged as a by-product of the evolution of homi¬ 
nid’s general intelligence (reflected in the increased 
size of hominids’ brain). According to this hypothe¬ 
sis, a host of diverse processes of selection resulted 
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in the emergence of language, but none of these pro¬ 
cesses involved the selection of properties or capaci¬ 
ties which are specifically linguistic. According to 
this hypothesis, individuals were selected through¬ 
out the evolutionary process on the basis of better 
motor control, better sequencing of motor activities, 
better memory capacity, more sophisticated social 
intelligence, more elaborate communication skills, 
better learning skills, and so on. Somewhere along 
the way, language emerged as a cultural product - 
either en bloc, in a few generations, or gradually 
through a lengthy process of cultural evolution. 

This hypothesis is parsimonious, and therefore 
attractive. It suffers, however, from several interre¬ 
lated problems. None of these problems is suffi¬ 
cient to disprove the hypothesis on its own. Taken 
together, however, they seem to suggest that at least 
a certain degree of language-specificity on the ge¬ 
netic level is necessary, beyond the obvious contri¬ 
bution of the evolution of hominid’s general intelli¬ 
gence. 

The first argument for linguistic specificity on the 
genetic level comes from language acquisition. As 
much as we disagree with Chomsky’s views on lan¬ 
guage, we acknowledge his huge contribution in 
constructing this argument. The process of language 
acquisition is unlike any other learning process we 
are aware of: it is incredibly fast and seemingly ef¬ 
fortless. Before they reach the age of two, children 
already master linguistic complexities of the type 
that adults find extremely difficult to grasp when 
they try to learn a second language (Bloom, 1993). 
Acquiring a native language does not seem to de¬ 
pend on the social status, economic privileges or 
general intelligence of the children. Language ac¬ 
quisition does not seem to depend on some special, 
conscious teaching activity on the parents’ part: 
children are known to acquire a native language in 
conditions of extreme neglect, where no adult makes 
a point of speaking to them, as long as they are ex¬ 
posed to language being spoken around them. It is 
extremely difficult to account for this achievement 
on the sole basis of the external input which children 
are exposed to. When a complex pattern of behavior 
is learned very quickly, on the basis of poor input, 
we usually assume some innate basis. For example, 
many small mammals show fear of hissing snake¬ 
like sounds, even if they were never exposed to such 
sounds before. These mammals are bom with a ner¬ 
vous system, which predisposes them to react to 


such sounds with fear. Cmcially, this innate predis¬ 
position is specific to hissing noises; it is not a 
general predisposition to associate sounds with fear. 
Unlike the fear reaction, linguistic knowledge does 
require a significant amount of learning, but the 
above general point seems to hold with respect to 
language, too: children are not such efficient and 
fast learners of other types of knowledge. The un¬ 
usual capacity to acquire language seems to be a 
specific one. 

The formation de novo of new languages is prob¬ 
ably the most extreme illustration of this fact. The 
best example of such a process is that of the develop¬ 
ment of the Nicaraguan sign language (Kegl et ah, 
1999). A group of deaf children, who were never 
exposed to sign language, and only communicated 
through a very limited system of conventionalized 
gestures, were gathered in a school in Managua with 
the goal of teaching them to lip-read. The attempt 
was a total failure: the children did not even under¬ 
stand what the goal of the exercise was. In a few 
months, however, a miraculous development oc¬ 
curred: the children developed an elaborate system 
of signs, which they used to communicate among 
themselves. When a group of younger, deaf children 
joined them, a few years later, this communication 
system rapidly evolved into a fully developed sign 
language - with a level of grammatical complexity 
similar to that of any other human language. In this 
case, the complexities of the new language could not 
have been acquired from the environment, for the 
simple reason that they were not there. It is highly 
likely that the creative formation of the language 
was based on some innately-given foundation, and 
that this foundation was language-specific. 

This achievement is all the more remarkable, be¬ 
cause the specific properties of the new language 
seem to be similar to properties which are univer¬ 
sally manifested by languages around the world. 
When a system exhibits universal features (at the be¬ 
havioral or anatomical level), despite the diversity 
of environments in which the system develops, this 
indicates that the developmental process is inde¬ 
pendent of variations in experience. Independence 
from variations in experience is often equated with 
developmental canalization. In principle, this does 
not necessarily mean that the genetic basis for cana¬ 
lized development is specific to the system. It may 
be the case, for example, that the universal proper¬ 
ties of language result from strong, general con- 
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straints on brain development in the child. This is the 
essence of Deacon’s (1997) universality argument. 
However, the fact that some of the universal proper¬ 
ties of language are so specific and complex sug¬ 
gests that the genetic basis is likely to be more spe¬ 
cific, too. The fact that all languages share the same 
core expressive envelope, and that the Nicaraguan 
children zoomed in on the same set of functions, 
cannot be easily explained on the basis of the gen¬ 
eral properties of the brain. 

In general, then, no dynamic model which we are 
aware of is capable of deriving the above phenom¬ 
ena - the patterns of language acquisition, the for¬ 
mation of new languages, the universal intricacies of 
the system - from the properties of general intelli¬ 
gence and learning. This, of course, does not mean 
that such a model is impossible, but it substantially 
weakens its feasibility. Interestingly, recent dy¬ 
namic models (e.g. Batali, 1998; Kirby, 1999, to ap¬ 
pear, a, to appear, b) provide partial demonstrations 
of the way grammatical complexities may have cul¬ 
turally emerged in evolution on the basis of pre-ex¬ 
isting social agreement on the semantic underpin¬ 
nings of the system - in line with our framework. In 
Kirby’s model, for example, individuals in the simu¬ 
lation are provided with a set of meanings, which 
they are then supposed to express by arbitrarily- 
chosen strings of sound. The individuals are as¬ 
sumed to be able to assign semantic interpretations 
to the strings produced by the others, remember 
them, compare them to previous strings, and extract 
grammatical regularities from them. After a large 
number of trials, regularities do emerge: the com¬ 
munity of individuals reaches social agreement on 
lexical items, word order and syntactic composi- 
tionality. This is indeed an impressive result. It is 
based, however, on certain presuppositions regard¬ 
ing the semantic foundation of linguistic knowl¬ 
edge, which we think of as a major part of the evolu¬ 
tionary question: within our framework, such con¬ 
stitutive semantic notions as event structure and the¬ 
matic roles, and such capacities as semantic mem¬ 
ory and interpretation, which play an explanatory 
role in Kirby’s model, are themselves objects of cul¬ 
tural evolution and genetic assimilation. 

Having established the possibility that both cul¬ 
tural and genetic evolution played a role in the evo¬ 
lution of language, we may now take the next step, 
and try to figure out how the two processes inter¬ 
acted. As we shall show, the properties of this inter¬ 


action provide the key explanatory tool for our ac¬ 
count of the evolutionary process. The major mech¬ 
anism driving this interaction is that of ‘genetic as¬ 
similation’, or ‘the Baldwin effect’. 

4. The Baldwin effect 

The Baldwin effect is the transformation, through 
Darwinian selection, of a learned response into a 
more genetically-fixed or ‘instinctive’ response. 
This process has been independently described, 
more than a 100 years ago, by Lloyd Morgan, J. M. 
Baldwin, and Fairfield Osborne, and was named the 
Baldwin effect by Simpson (1953). The crux of the 
phenomenon is this: when faced with a new chal¬ 
lenge in their environment (e.g. a new predator), in¬ 
dividuals first adapt to the challenge by learning. If 
the selective pressure is ongoing, and if the neces¬ 
sary learning process is lengthy and costly, individu¬ 
als will be selected for their ability to respond appro¬ 
priately to the challenge without the full investment 
in the learning process: in other words, individuals 
who are more ‘instinctive’ responders to the chal¬ 
lenge will be selected. 

Originally, Baldwin, Morgan and Osborne 
thought about this process in terms of the selection 
of new mutant individuals. They believed that the 
learning process allows the individuals in the popu¬ 
lation to survive long enough for congruent adaptive 
mutations to appear. C. H. Waddington, the British 
geneticist and embryologist, however, argued that 
the transformation occurs through the formation of 
new combinations of genes, following sexual re¬ 
shuffling and continuous exposure to the selective 
environment. He also believed that the same process 
leads to the transition of acquired or induced physio¬ 
logical and morphological characters into constitu- 
tively expressed characters. He called this process 
‘genetic assimilation’. Waddington experimentally 
demonstrated this process for several morphological 
and physiological characters in the fruit fly. Dro¬ 
sophila melanogaster. Thus, for example, Wadding¬ 
ton induced the development of four wings in fruit 
flies, by treating fertilized eggs with ether. He then 
selected the fraction of flies who responded to the 
ether by developing four wings, bred from them, 
treated their fertilized eggs with ether, allowed them 
to develop to the adult stage, selected again, and so 
on. In the first 20 generations, the appearance of the 
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four-wing phenotype in the seleeted line was de¬ 
pendent on the ether treatment of the eggs. After 20 
generations of systematie seleetion, however, few 
flies with four wings appeared in the selected line 
without ether treatment. The character whose devel¬ 
opment was at first dependent on external induction 
became genetically fixed and independent of the 
ether treatment (Waddington, 1953). Waddington 
showed that the ether treatment exposed variations 
in genes that could lead to an induced 4-wing pheno¬ 
type. During the 20 generations of selection, gene- 
combinations that produced an ether-induced, four¬ 
winged phenotype were constructed through the 
process of sexual reshuffling and selection, until a 
threshold was eventually crossed: a particular com¬ 
bination of genes, which enabled the development 
of four wings, now appeared with no need for an ex¬ 
ternal inducer (for a more detailed explanation of the 
experiment and its significance, see Avital and 
Jablonka, 2000). 

Waddington’s analysis was applied by Ewer 
(1956), Haldane (1959), Hardy (1965) and others, in 
an attempt to explain the evolution of behavioral in¬ 
stincts: Ewer, for example, explained the evolution 
of filial imprinting in the chick in terms of genetic 
assimilation, and Haldane used the same strategy to 
explain the innate, excited response of sheep-dog 
pups to the smell of sheep. Avital and Jablonka 
(2000) suggest that the innate fearful reaction of 
many small mammal and bird species to hissing 
snake-like noises, and the fearful reaction of spotted 
hyenas to the smell of lions, may be additional ex¬ 
amples of assimilated responses (Kruuk, 1972; Ed¬ 
munds, 1974). In all these cases, individuals were 
selected on the basis of their ability to learn to re¬ 
spond to the particular stimulus. The individuals 
who were selected were those who managed to learn 
to respond adaptively very fast - on the basis of a 
minimal number of trials. Eventually, after many 
generations of selection, some individuals could re¬ 
spond to the stimulus after a single exposure: the 
learned response became innate. It is important to 
understand that in these extreme cases, where the re¬ 
sponse to the stimulus becomes independent of 
learning, it is in fact the ability to learn the stimulus 
which was selected: throughout the process, learn¬ 
ing becomes more and more efficient and rapid, up 
to the point where it is ‘internalized’. It is also im¬ 
portant to realize that most assimilation processes do 
not end up with a completely internalized, instinc¬ 


tive response. In most cases, the assimilation pro¬ 
cess will be partial: it will significantly reduce the 
number of learning trials, and make the learning of 
the stimulus, which will still be needed to some ex¬ 
tent, much more rapid and efficient (Hinton and 
Nowlan, 1987; Behera and Nanjundiah, 1995). The 
speed of assimilation is expected to vary in different 
cases, depending on the intensity of selection, the 
number of genes involved, and the nature of their in¬ 
teractions. 

In some interesting cases, adaptation through 
learning does not only expose hidden genetic varia¬ 
tion - it also creates persistent changes in the envi¬ 
ronment. The new learned response changes the en¬ 
vironment in which the individuals and their de¬ 
scendants live. If individuals learn to avoid a new 
predator by digging burrows and hiding in them, for 
example, individuals will be selected on the basis of 
their ability to dig effectively. Crucially, however, 
the ability to dig the burrows actually creates a new 
physical environment, which means that individuals 
are now also selected for their ability to live in this 
environment. Thus, those individuals who will be 
positively selected will be both efficient diggers and 
efficient burrow-dwellers. In this case, the pressure 
to avoid the predator leads to what is called niche 
construction by the organisms - they actually con¬ 
struct the environment in which they live and in 
which they and their offspring are selected (Eewon- 
tin, 1978; Odling-Smee et ah, 1996). This makes se¬ 
lection more directional and more reliable, and en¬ 
hances its effectiveness. A famous example of this 
process in humans is the evolution of lactose-ab¬ 
sorption following the domestication of cattle, and 
the drinking of fresh milk (Durham, 1991). 

There is still more to the effects of genetic assimi¬ 
lation: it can sometimes lead to the sophistication of 
behavior. Avital and Jablonka (2000) have de¬ 
scribed three major ways in which this can occur. 
We will describe two of these ways, which will turn 
out to play important and interesting roles in our 
model of the evolution of language. The first process 
which leads to the sophistication of behavior is that 
of stretch-assimilate. 

Imagine a bird capable of reliably learning a se¬ 
quence of four consecutive acts, culminating, for ex¬ 
ample, in the building of a simple nest. Teaming be¬ 
yond this is very difficult. Assume, for the sake of 
simplicity, that there is a constraint on the learning 
capacity of this species of bird, so that improved 
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learning ability is unlikely to evolve (perhaps be- 
eause a big brain needs more energy to maintain, or 
there may be some developmental eonstraint on 
brain growth). Assume, however, that there is eon- 
sistent selection for the efficient and reliable perfor¬ 
mance of the nest-building behavior sequence so 
that one of the steps becomes genetically assimi¬ 
lated: it becomes innate. The bird now needs to learn 
only three steps, and will construct its simple nest 
much more quickly. Crucially, part of its unchanged 
learning capacity is now “liberated”. If selection for 
building good nests continues, the bird can now 
learn an additional nest-improving behavior. This 
novel behavior may be acquired in several different 
ways: individual learning through trial-and-error, 
learning from an experienced individual of another 
species, and so on. The bird may learn, for example, 
to tie the nest to the branch with plant strips, so it is 
more safe and less likely to fall when the wind 
blows. Once acquired, the new behavior will be able 
to spread in the population through social learning. 
There will now be five consecutive acts, one of 
which is innate. If building nests rapidly and effi¬ 
ciently continues to be advantageous, another previ¬ 
ously learned act can become assimilated, and yet 
another new learned one can be added, so that be¬ 
havioral sequence is lengthened by yet another step. 
It is thus possible to gradually lengthen the sequence 
of acts without changing the capacity to learn: genet¬ 
ically assimilating some previously learned behav¬ 
iors “frees” the individual to learn additional acts, 
without extending the limits set by its learning ca¬ 
pacity. This process, which involves the assimila¬ 
tion of a part of the behavioral sequence, and the re¬ 
sulting stretching of the sequence by learning may 
explain the evolution of many complex behavior 
patterns in nature. 

The second effect of genetic assimilation is that 
of the evolution of categorization. Imagine a popu¬ 
lation of monkeys, which is threatened by a new 
dangerous aerial predator, a monkey eating eagle. 
Predation pressure is very severe, and individuals do 
their best to learn to avoid the predator. Monkeys 
who are better at identifying and memorizing the 
shape of the eagle, its mode of flight, and so on, will 
have a better chance to survive. As in previous ex¬ 
amples, this process will expose hidden genetic vari¬ 
ation, which is specifically relevant to the threat. 
Now, after several monkey-generations, the selec¬ 
tion pressure may weaken, and even become negli¬ 


gible, because the population is now composed of 
individuals with a genetic make-up, which enables 
them to identify and avoid the predator much more 
efficiently than before. Note that this will result in 
partial genetic assimilation, rather than in a fully in¬ 
nate response, because the degree of assimilation 
depends on the intensity (and also on the duration) 
of the selection pressure. This result, however, has 
an interesting consequence: whereas a fully assimi¬ 
lated response would associate the avoidance re¬ 
sponse with a highly specific stimulus (the monkey 
eating eagle), partial assimilation will result in the 
association of the avoidance response with a larger, 
and much more diffuse, set of stimuli: it will include 
all aerial predators whose shape and mode of flight 
are similar enough to that of the Monkey Eating Ea¬ 
gle. Effectively, the monkeys will have formed the 
conceptual category of ‘aerial predator’. 

Avital and Jablonka (2000) suggest that a process 
of this type underlies the evolution of poison avoid¬ 
ance in rats. In the 1950s and 1960s, Garcia and his 
associates experimentally showed that rats learn to 
avoid poisonous food on the basis of its taste and 
smell, but find it difficult to learn to avoid poisonous 
food on the basis of an associated clicking sound 
(Rose, 1993). From the evolutionary point of view, 
this bias makes sense, because food-poisoning is as¬ 
sociated with taste and smell rather than with 
sounds. If during evolution, many different food 
types commonly caused food poisoning in rats, each 
particular food-type for a limited time, an associa¬ 
tion between a particular food and poisoning would 
not evolve to the point of becoming innate. How¬ 
ever, a very general bias for associating the taste and 
smell of food with subsequent gastric illness could 
be established through partial genetic assimilation: 
this is so, because the only two properties which all 
poisonous foods share are (i) the fact that they 
caused illness, and (ii) the fact that they all have a 
taste and a smell. This categorical generalization al¬ 
lows rats to behave in a rule-governed manner: 
‘avoid any food (whatever its specific taste and 
smell) if you felt sick after you ate it last’. 

To sum up: the process of genetic assimilation 
transforms learned behaviours into more geneti¬ 
cally-fixed, or ‘instinctive’ behaviours. The selec¬ 
tion pressure for more efficient learning exposes 
hidden variation in the ability to learn the specific 
behaviour, and leads to the selection of better, spe¬ 
cialised learners. As we have seen, the process may 
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also lead to the construction of a new selective envi¬ 
ronment, and to the sophistication of behaviour 
through the stretch-assimilate process, and through 
category-formation. For the process to work, some 
conditions need to be met: populations must have 
abundant genetic variation that is relevant to indi¬ 
viduals’ ability to respond to the stimuli; different 
sets of genes should become selectively relevant un¬ 
der new circumstances; phenotypically visible ge¬ 
netic variation should be able to be recruited and or¬ 
ganised into new adaptive genotypes through sexual 
reshuffling and selection; and selection for the adap¬ 
tive genotypes (genotypes that enable the more 
adaptive responses) should be maintained for sev¬ 
eral generations. What we know of the nervous sys¬ 
tem, and of the abundance of genetic variation in an¬ 
imals, not only allows us to make these assumptions, 
but also suggests that such processes must have been 
very common during evolutionary history. The evo¬ 
lution of hominids is no exception. 

The effects of genetic assimilation are all the 
more dramatic when the set of learned behaviors is 
itself developing through cultural evolution. In the 
following section, we will propose that this is the 
mechanism by which the linguistic mapping-system 
evolved. 


5. The evolutionary spiral 

Let us think about an arbitrary stage in the evolution 
of language. Assume that at this stage, stage N, a 
community of hominids used some preliminary, cul¬ 
turally-transmitted communication system - a sys¬ 
tem which mapped a very constrained set of mean¬ 
ings onto a set of phonetic markers. The meanings 
and their markers may have been very different from 
those we use in present-day languages. They may 
have included, for example, some conventionalized 
alarm-calls, emotional-social vocalizations, some 
referential lexical items, and so on. As is the case 
with other mammalian species, such signs are often 
transmitted culturally through social learning 
(Avital and Jablonka, 2000). In terms of our func¬ 
tional characterization of language, this communi¬ 
cation system was defined by a very limited expres¬ 
sive envelope. The set of meanings expressed by this 
system was obviously narrower than that of full- 
fledged human languages, but more importantly, it 
was also narrower than the set of conceptual mean¬ 


ings which individuals at this stage were capable of 
representing: in other words, individuals at this 
stage were capable of thinking and feeling much 
more than they could say. This is a very reasonable 
assumption: it has been dramatically demonstrated 
in experimental settings involving chimpanzees. As¬ 
sume that individuals at this stage used their quasi- 
linguistic system comfortably and naturally, and 
that their children comfortably acquired it. Assume 
further, that the community was characterized by a 
particular genetic constitution, which allowed for 
the acquisition and usage of the system - with the 
necessary amount of variability, some individuals 
were better at acquiring and using the system than 
others. Last but not least, assume that all members of 
the group shared the necessary cognitive precursors 
for linguistic communication: they had some pre¬ 
liminary form of a theory ofmind, some level of con¬ 
ceptualization, some implicit understanding of so¬ 
cial relations and hierarchies, and the motivation for 
information sharing. This last assumption has re¬ 
cently been scrutinized by Knight (1998) and others. 
It has been argued that language is particularly prone 
to evolutionary instability, resulting from the usage 
of language for cheating, because linguistic mes¬ 
sages are cheap, and can therefore be easily used to 
convey false information. We believe that Knight 
overestimates the role of deception in the evolution 
of linguistic communication, for at least two rea¬ 
sons: first, linguistic communication is multi-func¬ 
tional, and some of its most important functions can¬ 
not be reduced to the sharing of verifiable informa¬ 
tion: questions, requests, threats, ritualized stories, 
poems and prayers are neither true nor false. Second, 
the socially-oriented nature of linguistic communi¬ 
cation, and what we think about the cohesive na¬ 
ture of early human societies, may provide various 
effective types of controls against systematic cheat¬ 
ing (Dor and Jablonka, in preparation). At any rate, 
the assumption that early hominids may have 
shared the motivation for the social communication 
of information seems to us to be quite a reasonable 
one. 

Assume, then, that at different points in time, 
throughout stage N, individuals or groups of indi¬ 
viduals came up with linguistic innovations. We 
may imagine a very wide range of types of linguistic 
innovations, which must have occurred during the 
entire evolutionary process, and most of them must 
have happened again and again. Assume that at 
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stage N, they included new lexical items for specific 
referential meanings; some more abstract markers 
for existing and novel conceptual distinctions; new 
pragmatic conventions for linguistic communica¬ 
tion, and so on. How did these innovations come 
about? Well, like all cultural innovations, they may 
have been arrived at by accident, through conscious 
effort, or as part of social play, by individuals who 
were clever enough, or explorative, or just lucky, 
and happened to be in the right social context at the 
right time. Many of the innovators must have been 
inquisitive youngsters, and in some cases, the inno¬ 
vation may have been the result of group effort 
(think about the achievement of the Nicaraguan chil¬ 
dren). The social driving force for the innovations 
must have been associated with a growing pressure 
for better communication within the group, and this 
pressure may have been causally related to a whole 
host of processes: an increase in group size (Dunbar, 
1996); significant changes in ecological conditions; 
changes in tool usage; changes in the need for social 
co-operation; or changes in the patterns of interac¬ 
tion between different hominid populations. Differ¬ 
ent sets of causal factors may have been involved in 
each particular case, but we may confidently assume 
that we do not need to invoke a genetic explanation 
for any of the innovations: by their very definition, 
the linguistic innovations of stage N were within the 
genetically-based capacity of their inventors. 

What happened to each of these linguistic inno¬ 
vations, once it was invented? Well, if it was not too 
remote from the cognitive and linguistic world of the 
community, the innovation may have been learned 
by some of the other members of the community. 
Although only a small minority in any community is 
capable of real innovation, a much larger group of 
individuals is capable of learning to understand and 
use the innovation once it is there. Here, again, we 
do not have to invoke a genetic explanation: ho¬ 
minid cognition is extremely plastic, and social 
learning takes advantage of this plasticity. More¬ 
over, research on child language and on chimpan¬ 
zees teaches us that at every developmental stage, 
individuals’ achievements in linguistic comprehen¬ 
sion are much more impressive than their achieve¬ 
ments in linguistic production. The innovator thus 
has a good chance of being understood, especially 
by those individuals who are closely related to him 
or her - family members, close friends and so on. 
Note, moreover, that different individuals probably 


differed with respect to their ability to understand 
and use the innovation, and that at least some of this 
variability resulted from variability in individuals’ 
genetic make-up: some individuals understood the 
innovation better than others; some learned to use it 
themselves; others managed to passively compre¬ 
hend it; some others may not have been able to keep 
up with it. 

What happened, then, to the innovation once it 
was learned by a few members of the community? 
Its fate depended to a significant degree on its prop¬ 
agation and dissemination across the population. 
These, in turn, probably depended upon a large set 
of considerations, including the functional signifi¬ 
cance of the innovation, the social status of the inno¬ 
vator and the first learners, the level of social cohe¬ 
sion within the first subgroup of learners, and so on. 
We may assume that in the innovator’s own genera¬ 
tion, the propagation of the new linguistic tool was 
unstable and uncertain. Many innovations, includ¬ 
ing some very adaptive ones, probably disappeared 
at this stage, because the significance of the innova¬ 
tion can sometimes be fully appreciated only when it 
is used by a significantly large and cohesive group 
of communicators (there is positive frequency de¬ 
pendent selection, up to a point, at least). We may, 
however, assume that the innovation had a better 
chance of establishing itself after the first learners 
transmitted it to their offspring, because children 
play a significant role in the establishment of cul¬ 
tural traditions (e.g. the role of juveniles in the estab¬ 
lishment of food washing by Japanese macaques in 
Koshima island). We may also assume that for a 
long period of time, after the original invention, the 
innovation went through a process of cultural evolu¬ 
tion: it may have been improved upon in all sorts of 
ways, and it may have become conventionalized and 
streamlined, in a long dynamics of learning and re¬ 
learning (cf Kirby, 1999). 

As noted above, the chances of an innovation to 
establish itself crucially depend upon its adaptive 
value as a tool of social communication. For an in¬ 
novation to survive, its usage should be beneficial 
for the speakers who decide to adopt it. In general, 
the adaptive value of a linguistic innovation is a di¬ 
rect function of its information potential, and an in¬ 
verse function of its processing effort (Sperber and 
Wilson, 1986). An innovation carries high informa¬ 
tion potential if it allows for the transfer of more in¬ 
formation which is relevant for the community; if it 
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adds relevant elements to the expressive envelope of 
the system; and if it allows for more precise produc¬ 
tion and interpretation. An innovation requires low 
processing effort to the extent that it is relatively 
easy to acquire and use in contexts of social commu¬ 
nication. (This is why parity considerations are not 
enough: meaning-blind conventions have no infor¬ 
mation potential whatsoever.) Note that the infor¬ 
mation potential of an innovation does not necessar¬ 
ily have to be related to the practical considerations 
which are usually discussed in the literature, such as 
the efficiency of co-operation in hunting or fighting, 
or the sharing of information about the natural envi¬ 
ronment. Although these considerations are proba¬ 
bly important, it seems to us that the information po¬ 
tential of an innovation is also a social issue: it has to 
do with the sharing of social information (social re¬ 
lations, social events and social hierarchies), with 
the sharing of social narratives and myths, and with 
the construction of social epistemology (cf. Knight 
1998, and Heeschen, in press). The construction of 
social epistemology, in turn, plays an important role 
in linguistically-based social identity - which 
strengthens the adaptive value of the innovation to 
an additional degree. The semantic categories we 
discussed in section 1 of this paper - event structure, 
epistemic status, animacy - seem to be especially 
functional in this respect (see Dunbar, 1996). 

Now, for a linguistic innovation to survive and 
propagate it has to be adaptive for a sufficient 
amount of time, and preferably in a wide array of 
changing circumstances. This is especially true for 
categorical markers. New lexical items may come 
and go, depending on circumstances, but persistent 
categorical markers were those which survived 
throughout social changes over long periods of time. 
Moreover, for linguistic innovations to survive they 
have to meet the conditions set by system con¬ 
straints. These are of at least two types: first, lin¬ 
guistic innovations have to comply with psychologi¬ 
cal constraints. Those innovations which corre¬ 
sponded to pre-existing cognitive, or developmental 
biases, were probably selected, as they were the ones 
most easily learned, remembered and transmitted 
(cf Sperber, 1996). Second, as the linguistic system 
evolved, it set its own constraints on new innova¬ 
tions: they had to comply with the already estab¬ 
lished system. This means that, at least from a cer¬ 
tain point in the evolution of language, the system 


itself dictated the directional nature of its own future 
evolution. Moreover, as the system became more 
complex, it gradually set more and more constraints 
on its own ability to go through major changes. 
Thus, although full-fledged languages as we know 
them are still very flexible, the universal expressive 
envelope of human languages may not be expanding 
today at the rate we assume for the early hominid pe¬ 
riod. Note, moreover, that some types of informa¬ 
tion - such as emotional messages, or manual in¬ 
structions - were probably very effectively com¬ 
municated by non-linguistic means, such as body 
language, facial expressions, mime, song and dance, 
and so on. Linguistic innovations directed at these 
types of information did not survive (or may have 
not be invented in the first place), because the other 
means of communication rendered them unneces¬ 
sary. Thus, division of labor between the different 
systems of communication may have had a signifi¬ 
cant role in the cultural evolution of the highly-con- 
strained expressive envelope of language. 

Let us assume, then, that some of the adaptive lin¬ 
guistic innovations of stage N managed to spread 
and establish themselves in the community. This es¬ 
tablishment was very enduring, because it was both 
dependent upon, and constitutive of, the social struc¬ 
ture, and because social traditions are by their very 
nature self-perpetuating. This cultural change en¬ 
hanced the communicative capacity of individuals 
within the community, thus increasing the fitness of 
the best individual communicators, as well as the fit¬ 
ness of the entire group. Crucially, however, the es¬ 
tablishment of the innovation also raised the de¬ 
mands for social learning imposed on individuals in 
the community, they did not only have to acquire the 
new innovations in order to be able to participate in 
social communication - they also had to learn to 
look at the world in new ways, direct attention to 
new aspects of reality, process and remember new 
types of information, and so on. In short: the linguis¬ 
tic innovations which established themselves in the 
community changed the social niche, and the inhab¬ 
itants of this new niche had to adapt to it - just like 
the small mammals which dig burrows in order to 
hide from predators. In adapting themselves to the 
niche, the individuals could probably count on the 
built-in residual plasticity of their minds. Those in¬ 
dividuals, and cohesive groups of individuals, who 
made better use of the innovations for efficient com- 


EVOLUTION OF LANGUAGE 


51 


munication - for whatever cultural or social reason - 
probably benefited: they were probably reproduc- 
tively more successful than others, and more likely 
to thrive. 

Very gradually, however, the increasing cogni¬ 
tive demands set by the evolving linguistic niche 
started to expose hidden genetic variation. In our 
terms, residual plasticity was gradually stretched, 
and individuals found the accumulating linguistic 
demands more and more demanding. This process 
must have taken a long time. Eventually, however, 
after a very long period of consistent, directional 
cultural selection, genetic assimilation occurred: 
some individuals dropped out of the race; other sur¬ 
vived. The frequencies of those gene combinations 
which contributed to easier language acquisition and 
use increased in the population. Eventually, in stage 
N + 1, we could find a community whose general ge¬ 
netic make-up was such that individuals comfort¬ 
ably used the more sophisticated linguistic system, 
and children comfortably acquired it. Obviously, 
this allowed for the whole process to start all over 
again: as a result of assimilation, individuals were 
freed once again to make use of their cognitive plas¬ 
ticity, to invent and learn more linguistic innova¬ 
tions. 

What could be genetically assimilated in our tran¬ 
sition from stage N to stage N + 1 ? First, it is obvious 
that all the relevant aspects of general cognition 
were assimilated, to a certain degree at least, accord¬ 
ing to suggestions by Eieberman (1991), Donald 
(1991), Jablonka and Rechav (1996), Deacon (1997) 
and others: individuals at stage N + 1 were probably 
more intelligent, had better memories and better vol¬ 
untary control of their sound production mecha¬ 
nisms, and were probably smarter social agents. We 
believe, however, that individuals at stage N + 1 had 
a cognitive constitution which was, in some minute 
but significant ways, more biased towards the ac¬ 
quisition and usage of language than the cognitive 
constitution of individuals at stage N. The process of 
genetic assimilation, which followed the long period 
of cultural evolution in which a community became 
more and more dependent on linguistic communica¬ 
tion, and in which the survival of individuals de¬ 
pended to an increasing extent on their linguistic 
performance - must have targeted those cognitive 
capacities which were most useful for this specific 
type of behavior: some examples are the capacity of 


recognizing discrete conceptual categories, of rapid 
processing of the speech channel, of recognizing lin¬ 
guistic-communicative intent, and of lexical mem¬ 
ory. These are language-specific, and they must 
have been targeted by linguistically-driven genetic 
assimilation. 

The genetic assimilation of these capacities was 
most likely partial, rather than complete. It could not 
have led to a completely innate response, because 
the on-going process of cultural evolution made sure 
that the cultural environment to which individuals 
were adapting was constantly changing. As we have 
already indicated, this state-of-affairs must have had 
far-reaching consequences in terms of the genetic 
evolution of categorization, in our case, linguistic 
categorization: very specific innovations, such as 
the meanings of specific words or specific morpho¬ 
logical markers, were not assimilated, because they 
were too variable and context-dependent, and be¬ 
cause they changed too rapidly throughout cultural 
evolution. The partial assimilation of the innova¬ 
tions, however, resulted in a cognition biased to¬ 
wards the common denominators of the innovations 
- those elements of meaning, shared by entire sets of 
innovations, which were less variable and less con¬ 
text-dependent. Effectively, this process resulted in 
a cognition biased towards a specific set of semantic 
categories. These categories did not end up com¬ 
pletely assimilated, because cultural change still put 
a high premium on epistemic flexibility, but just like 
the partial assimilation of predator-stimuli in the 
monkey population we have discussed earlier, the 
partial assimilation of the semantic categories delin¬ 
eated a set of stimuli for which a dramatically 
smaller number of learning trials was needed. The 
process resulted in a cognition which was innately 
more attuned to a specific set of semantic categories 
and their linguistic marking, and was thus innately 
more suited for their efficient acquisition and usage. 

This process of linguistically-based genetic as¬ 
similation may have actually been related in an in¬ 
teresting way to the general evolution of human cul¬ 
ture and human conceptualization. As we have 
aheady indicated, genetic assimilation also targeted 
general intelligence. We know, after all, that there 
was no strong constraint on the evolution of the 
hominids’ general intelligence: hominid brains dou¬ 
bled in size in 2.5 million years. As the process of 
cultural and linguistic evolution constantly led to an 
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extension of the environment as perceived by the 
community, individuals were constantly faced with 
more information about the world: they could learn 
more, about more aspects of the world, because they 
could think and communicate more effectively. Thi s 
created a process of positive feedback, the more in¬ 
dividuals learned about the world, the more they 
could communicate about; and the more they could 
communicate, the more they could learn. On the one 
hand, individuals and the whole community were 
now in a position to evolve their conceptual struc¬ 
tures - with the aid of the more complex communi¬ 
cation tool - language. On the other hand, the evolu¬ 
tion of conceptual structures, and general cognitive 
tools for learning, remembering and so on, helped 
the concomitant evolution of the linguistic system. 
The linguistic system thus spiraled together with the 
conceptual system (and with the motor control sys¬ 
tem, which we have not discussed in this paper). 
This wider spiral also included a wide variety of 
non-linguistic, culturally-based evolutionary pro¬ 
cesses, which interacted with each other in complex 
ways. The process resulted both in the expansion of 
hominids’ conceptual capacities, and in the con¬ 
struction and expansion of their linguistic expres¬ 
sive envelope. 

6. Conclusion 

We started out by characterizing language as a trans¬ 
parent mapping-system, dedicated to the expression 
of a constrained subset of meanings by means of 
sound concatenation. This characterization meets 
the conditions set by the Neo-Darwinian mode of 
evolutionary explanation, as it defines language as a 
cognitive system which is both functional and 
unique. As we claimed, recent advances in linguistic 
research support this conception, in direct opposi¬ 
tion to Chomsky’s traditional conception of the for¬ 
mal, meaning-blind nature of grammar, as well as to 
the functionalists’ conception of language as a gen¬ 
eral-purpose communication system. 

We then characterized the evolution of this sys¬ 
tem - and the evolution of its social users - as the in¬ 
teraction between cultural and genetic evolution. 
We discussed the evolution of the linguistic system 
in cultural terms - as the social process of innova¬ 
tion, production, comprehension, transmission and 
propagation of linguistic conventions, in which a 


community isolates and foregrounds certain aspects 
of its epistemology, and develops social agreement 
about the means of their expression. This process re¬ 
sults in a communication system which is both func¬ 
tional, rather than merely formal, and highly-con- 
strained, rather than general-purpose, because it is 
founded on a selected subset of semantic categories. 
In each stage of this long and continuous process, 
the expressive envelope of the system expands, and 
the structural means of expression are sophisticated. 

We then discussed the genetic evolution of the 
users of this system in terms of partial genetic as¬ 
similation - resulting in a linguistically-biased cog¬ 
nition, enabling easier and more effective language 
acquisition and use. As we have claimed, partial ge¬ 
netic assimilation does not copy linguistic speci¬ 
ficities into brain structures, and it does not result in 
genes for linguistic rules. Partial genetic assimila¬ 
tion does construct a genetic make-up which allows 
for the development of a cognition biased towards 
the acquisition and use of linguistic knowledge - but 
a significant amount of learning remains mandatory. 
This conception captures both the fact that children 
seem to be innately-predisposed towards the acqui¬ 
sition (and, in more extreme cases, the invention) of 
language, and the fact that the behavioral domain- 
specificity of language does not seem to be explic¬ 
itly manifested in innately-given patterns of the 
brain. This view avoids the artificial dichotomy be¬ 
tween nature and nurture, in line with the approach 
suggested by Elman et al. (1996), and suggests that 
the question of innateness is a relative one: it is the 
question of how much, when and what type of learn¬ 
ing is necessary in each stage of the evolutionary 
process. Moreover, our view of the continuous inter¬ 
action between cultural and genetic evolution takes 
the dynamic nature of languages, and the attested 
variability between different languages, to be funda¬ 
mental properties of the evolutionary process. The 
properties of the process lead us to expect its final 
products, i.e. modern-day languages, to go on 
changing in their social contexts, and to manifest an 
interesting blend of universality and variability. 
This, of course, is exactly what we find. 

Our conception of the process renders the tradi¬ 
tional distinction between the syntactic nature of 
present-day languages, and the supposedly pre-syn¬ 
tactic nature of the so-called ‘proto-language’ theo¬ 
retically unnecessary. The traditional distinction is 
based, for example, on the development of full- 


EVOLUTION OF LANGUAGE 


53 


Hedged creole languages from rudimentary pidgins 
(cf. Bickerton, 1984, 1990). Pidgins are usually de- 
seribed as lacking in grammatical complexity, 
whereas creoles manifest the full range of grammati¬ 
cal structures characteristic of mature languages. To 
the extent that grammatical complexity is viewed as 
autonomous from meaning, this development may 
indeed be thought of as a qualitative leap from one 
type of cognitive system to another. However, to the 
extent that grammatical structures are thought of as 
semantically-determined, the rise in grammatical 
complexity may be thought of as the result of a par¬ 
allel rise in the complexity of the expressive enve¬ 
lope of the language. Obviously, this rise in com¬ 
plexity may be a remarkable one - pidgins, after all, 
are very limited tools of practical communication, 
whereas creoles can be used for writing poetry - but 
it is nevertheless a rise in the complexity of the same 
type of cognitive system. As far as the evolution of 
language is concerned, then, we are entitled to think 
about the entire evolutionary process as a gradual 
and continuous one. 

The framework developed in this paper recon¬ 
ciles the two major approaches to language evolu¬ 
tion - the one that focuses on the evolution of lan¬ 
guage as a system of social communication, and the 
one which focuses on the evolution of the structur¬ 
ally unique properties of language. According to our 
conception, the formal properties of language are a 
reflection of meaning relations, and these, in turn, 
have been selected, throughout the evolution of lan¬ 
guage, on the basis of their adaptive value in terms 
of social communication. The formal question and 
the social question are thus one and the same. 

A personal note 

Twelve years ago John Maynard Smith invited me (E. J.) to give 
a lecture in Sussex on the evolutionary effects of epigenetic in¬ 
heritance. This invitation was not due to John’s agreement with 
the ideas Marion Lamb and I had been developing on the sub¬ 
ject. On the contrary - he decidedly disagreed with many of 
them, even declaring at one point that dangerous Lamarckians 
like us should be locked in cellars (although he did compassion¬ 
ately concede our cellar to be a wine cellar). Nevertheless, John 
thought our ideas were worth a good, long argument. And we 
have been arguing ever since - with fun, friendship, and for me, 
with great profit. For, as all his friends and students know, John 
has an uncanny ability to stimulate thinking, and an insatiable 
appetite for ideas. This paper, on a subject close to John’s heart 
- the evolution of language - is written in the same argumenta¬ 
tive spirit as that which John has been so good at stimulating. 
We hope you will find it worth a good, long argument, John! 
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